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Special Publication 500-225, Apr. 1995, pp. 95-104. 

Voorhees et al., "Learning Collection Fusion Strategies", Proceedings of SIGIR '95, 
Jul. 1995, pp. 172-179. 

ART-UNIT: 271 

PRIMARY-EXAMINER: Lintz; Paul R. 
ATT Y- AGENT -FIRM: Ahmed; Adel A. 

ABSTRACT : 

A method implemented on a computer for facilitating World Wide Web Searches and like 
database searches by combining search result documents, as provided by separate 
search engines in response to a query, into one single integrated list so as to 
produce a single document with a ranked list of pages, includes the steps of: (a) 
training the computer for each search engine by clustering training queries and 
building cluster centroids; (b) Assign weights to each cluster reflecting the number 
of relevant pages expected to be obtained by this search engine for queries similar 
to those in that cluster; (c) processing an incoming query by selecting, for each 
search engine, that cluster centroid that is most similar to the incoming query and 
returning the weight associated with the selected cluster as the weight of the 
current search engine; and (d) apportioning the N slots in the retrieved set 
according to the weights returned by each search engine. 
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BASIC-ABSTRACT: 

NOVELTY - Map data and web site description corresponding to data fields of a search 
engine (210) is stored in database (222) in HTML form. The method of registration of 
web site to each engine is also stored in database. By transmitting the registration 
method data and by mapping the web site data to search engine the web site is 
registered . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for a web site 
registration apparatus for registering web site with several search engines. 

USE - For world wide web. 

ADVANTAGE - The problem of difficulty in registering web site with a strangely 
formatted registration page of search engine is overcome by performing automatic 
registration of web sites with search engines. 

DESCRIPTION OF DRAWING (S) - The figure shows components in a networked computer 
system. 
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QuarterDeck, URL : http : //arachnid . qdeck . com/qdeck/products/webcompass . 
Towell et al . , "Learning Collection Fusion Strategies for Information Retrieval", 
Proceedings of the 12th Annual Machine Learning Conference, Jul. 1995, pp. 540-548. 
Voorhees et al . , "The Collection Fusion Problem", Proceedings of TREC-3, NIST 
Special Publication 500-225, Apr. 1995, pp. 95-104. 

Voorhees et al . , "Learning Collection Fusion Strategies", Proceedings of SIGIR »95, 
Jul. 1995, pp. 172-179. 

ART-UNIT: 271 

PR I MARY -EXAMINER : Lintz; Paul R. 
ATT Y- AGENT -FIRM: Ahmed; Adel A. 

ABSTRACT : 

A computer- implemented method for facilitating World Wide Web Searches and like 
database searches by combining search result documents, as provided by separate 
search engines in response to a query, into one single integrated list so as to 
produce a single document with a ranked list of pages, by forming a set of selected 
queries, the queries including respective terms, for which selected queries 
relevance data from past data is known, herein referred to as training queries, in a 
vector space comprising all training queries, the relevance data comprising 
judgments by a user as to whether a page is appropriate for a query which retrieved 
it. Further steps in the method are identifying a set of k most similar training 
queries to current query q, computing an average relevant document distribution of 
the k queries within the training queries ' search results for each of the search 
engines, using the computed relevant document distributions, finding an optimal 
number of pages to select from the result set of each search engine when N total 
pages are to be retrieved, and creating a final retrieved set by forming the union 
of the top . lambda sub . s pages from each search engine. 
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Proceedings of the 12th Annual Machine Learning Conference, Jul. 1995, pp. 540-548. 
Voorhees et al., "The Collection Fusion Problem", Proceedings of TREC-3, NIST 
Special Publication 500-225, Apr. 1995, pp. 95-104. 

Voorhees et al . , "Learning Collection Fusion Strategies", Proceedings of SIGIR '95, 
Jul. 1995, pp. 172-179. 

ART-UNIT: 271 

PR I MARY -EXAMINER : Lintz; Paul R. 
ATTY- AGENT -FIRM: Ahmed; Adel A. 

ABSTRACT : 

A method implemented on a computer for facilitating World Wide Web Searches and like 
database searches by combining search result documents, as provided by separate 
search engines in response to a query, into one single integrated list so as to 
produce a single document with a ranked list of pages, includes the steps of: (a) 
training the computer for each search engine by clustering training queries and 
building cluster centroids; (b) Assign weights to each cluster reflecting the number 
of relevant pages expected to be obtained by this search engine for queries similar 
to those in that cluster; (c) processing an incoming query by selecting, for each 
search engine, that cluster centroid that is most similar to the incoming query and 
returning the weight associated with the selected cluster as the weight of the 
current search engine; and (d) apportioning the N slots in the retrieved set 
according to the weights returned by each search engine. 
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